推荐一本免费的在线电子书,《用Python进行自然语言处理》, 用NLP 工具包(开源免费,Python)来进行文本分析。特别适合初学计算语言学的学生。好像没有好的中文切词。当然,免不了需要提一下 Dan Jurafsky 教授,大家可以搜索一下,找找他的讲义。
简要翻译一下提纲:
(书的主站点:http://www.nltk.org/)
Natural Language Processing with Python
— Analyzing Text with the Natural Language Toolkit
Steven Bird, Ewan Klein, and Edward Loper
1. Language Processing and Python (extras)
2. Accessing Text Corpora and Lexical Resources (extras)
4. Writing Structured Programs (extras)
5. Categorizing and Tagging Words
6. Learning to Classify Text (extras)
7. Extracting Information from Text
8. Analyzing Sentence Structure (extras)
9. Building Feature Based Grammars